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Fast detection of communication patterns in distributed executions j 
Thomas Kunz, Michiel F. H. Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Advanced Studies 
on Collaborative research 

Full text available: ^£| pdf(4.21 MB) Additional Information: full citation , abstract , references , index terms 

Understanding distributed applications is a tedious and difficult task. Visualizations based on 
process-time diagrams are often used to obtain a better understanding of the execution of 
the application. The visualization tool we use is Poet, an event tracer developed at the 
University of Waterloo. However, these diagrams are often very complex and do not provide 
the user with the desired overview of the application. In our experience, such tools display 
repeated occurrences of non-trivial commun ... 



2 Gettin g and giving information: What is this text about? jjj^ 
Nicolas Hernandez, Brigitte Grau 

October 2003 Proceedings of the 21st annual international conference on 
Documentation 

Full text available: ^ pdf(229.96 KB) Additional Information: full citation , abstract , references , index terms 

Most work in text retrieval aims at presenting the information held by several texts in order 
to give entry clues towards these texts and to allow a navigation between them. Besides, a 
lesser interest is dedicated to the definition of principles for accessing content of single 
documents. As most information retrieval systems return documents from an initial request 
made of words, a usual solution consists of presenting document titles and highlighting 
words of the request inside a passage or in ... 

Keywords: dynamic summarization, meta-descriptors and topical descriptors identification, 
text structure, text visualization 



An efficient meta-lock for implementing ubiquitous synchronization 

Ole Agesen, David Detlefs, Alex Garthwaite, Ross Knippel, Y. S. Ramakrishna, Derek White 

October 1999 ACM SXGPLAN Notices , Proceedings of the 14th ACM SIGPLAN 

conference on Object-oriented programming, systems, languages, and 

applications, volume 34 issue 10 
Full text available- fi3 odf(2 00 MB) Additional Information: full citation , abstract , references , citings, index 

terms 

Programs written in concurrent object-oriented languages, especially ones that employ 
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thread-safe reusable class libraries, can execute synchronization operations (lock, notify, 
etc.) at an amazing rate. Unless implemented with utmost care, synchronization can 
become a performance bottleneck. Furthermore, in languages where every object may have 
its own monitor, per-object space overhead must be minimized. To address these concerns, 
we have developed a meta-lock to mediate access to synchro ... 

Keywords: concurrent threads, object-oriented language implementation, synchronization 



4 S pecial issue: Al in engineering \ 
D. Sriram, R. Joobbani 

January 1985 ACM SIGART Bulletin, issue 91 

Full text available: ^ pdf(8.79 MB) Additional Information: full citation , abstract 

The papers in this special issue were compiled from responses to the announcement in the 
July 1984 issue of the SIGART newsletter and notices posted over the ARPAnet. The interest 
being shown in this area is reflected in the sixty papers received from over six countries. 
About half the papers were received over the computer network. 

5 S pecial issue on knowledge representation j 
Ronald J. Brachman, Brian C. Smith 

February 1980 ACM SIGART Bulletin, issue 70 

Full text available: ^ pdf(13.13 MB) Additional Information: full citation , abstract 

In the fall of 1978 we decided to produce a special issue of the SIGART Newsletter devoted 
to a survey of current knowledge representation research. We felt that there were twe 
useful functions such an issue could serve. First, we hoped to elicit a clear picture of how 
people working in this subdiscipline understand knowledge representation research, to 
illuminate the issues on which current research is focused, and to catalogue what 
approaches and techniques are currently being developed. Secon ... 

6 A knowled g e-based electronic information and documentation system | 
Robert L. Young, Elaine Kant, Larry A. Akers 

January 2000 Proceedings of the 5th international conference on Intelligent user 
interfaces 

Fuli text available- f 8 ! Ddf(795 61 KB) Additional Information: full citation , abstract , references , citings , index 
U V 1 " T^H 3 — : terms 

We describe the capabilities of a knowledge-based system to automatically generate a 
collection of electronic notebooks containing various forms of online documentation and 
reports. This system is a subsystem of a larger knowledge-based system called SciNapse. 
SciNapse's raison d'etre is to transform high-level simulation problem specifications into 
executable numerical programs. The electronic notebooks are generated from the same 
domain knowledge bases that the system uses to perform its ... 

Keywords: intelligent interfaces, knowledg-based systems 
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Interactive Editing Systems: Part II 
Norman Meyrowitz, Andries van Dam 

September 1982 ACM Computing Surveys (CSUR), volume 14 issue 3 

Full text available: Q pdf(9.17 MB) Additional Information: full citation , references , citings, index terms 
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Sargur N. Srihari 

November 1986 Proceedings of 1986 ACM Fall joint computer conference 

Full text available: f^ pdf(1.38 MB) Additional Information: full citation , references , citings , index terms 



9 A unified framework for model-based clustering 
Shi Zhong, Joydeep Ghosh 

December 2003 The Journal of Machine Learning Research, volume 4 
Full text available: ^| pdf(851.48 KB) Additional Information: full citation , abstract , index terms 

Model-based clustering techniques have been widely used and have shown promising 
results in many applications involving complex data. This paper presents a unified 
framework for probabilistic model-based clustering based on a bipartite graph view of data 
and models that highlights the commonalities and differences among existing model-based 
clustering algorithms. In this view, clusters are represented as probabilistic models in a 
model space that is conceptually separate from the data space. For ... 

10 PCCTS reference manual: version 1.00 
T. J. Parr, H. G. Dietz, W. E. Cohen 
February 1992 ACM SIGPLAN Notices, volume 27 issue 2 

Full text available: ^ pdf(3.77 MB) Additional Information: full citation , citings , index terms 





11 Document formatting: Inter and intra media-object QoS provisioning in adaptive 
formatters 

Rogerio Ferreira Rodrigues, Luiz Fernando Gomes Soares 

November 2003 Proceedings of the 2003 ACM symposium on Document engineering 

Full text available: ■ m P df(483.39 KB) Additional Information: full citation , abstract, references , citings, index 
^ terms 

The development of hypermedia/multimedia systems requires the implementation of an 
element, usually known as formatter, which is in charge of receiving the specification of a 
document (structure, media-object relationships and presentation descriptions) and 
controlling its presentation. The process of controlling and maintaining the presentation of a 
hyperdocument with an output of acceptable quality is a QoS orchestration problem, which 
needs to be treated by formatters in two related le ... 

Keywords: hyperProp system, hypermedia formatter, media synchronization, quality of 
service 



12 Curriculum 68: Recommendations for academic programs in computer science: a 
re port of the ACM curriculum committee on computer science 

William F. Atchison, Samuel D. Conte, John W. Hamblen, Thomas E. Hull, Thomas A. Keenan, 
William B. Kehl, Edward J. McCluskey, Silvio O. Navarro, Werner C. Rheinboldt, Earl J. 
Schweppe, William Viavant, David M. Young 
March 1968 Communications of the ACM, volume n issue 3 

Full text available: pdf(6.63 MB) Additional Information: full citation , references , citings 



Keywords: computer science academic programs, computer science bibliographies, 
computer science courses, computer science curriculum, computer science education, 
computer science graduate programs, computer science undergraduate programs 
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13 Document access and understanding: Methods for the semantic analysis of document Q 
markup 

Petra Saskia Bayerl, Harald Lungen, Daniela Goecke, Andreas Witt, Daniel Naber 
November 2003 Proceedings of the 2003 ACM symposium on Document engineering 

Full text available: ^ pdf(230.61 KB) Additional Information: full citation , abstract , references , index terms 

We present an approach on how to investigate what kind of semantic information is 
regularly associated with the structural markup of scientific articles. This approach 
addresses the need for an explicit formal description of the semantics of text-oriented XML- 
documents. The domain of our investigation is a corpus of scientific articles from 
psychology and linguistics from both English and German online available journals. For our 
analyses, we provide XML-markup representing two kinds of semantic ... 

Keywords: XML, information extraction, prolog, semantic analysis 



14 Translator writing systems 
Jerome Feldman, David Gries 

February 1968 Communications of the ACM, volume n issue 2 

Full text available: ^| pdf(4.47 MB) Additional Information: full citation , abstract , references , citings 

A critical review of recent efforts to automate the writing of translators of programming 
languages is presented. The formal study of syntax and its application to translator writing 
are discussed in Section II. Various approaches to automating the postsyntactic (semantic) 
aspects of translator writing are discussed in Section III, and several related topics in 
Section IV. 

Keywords: compiler compiler-compiler, generator, macroprocessor, meta-assembler, 
metacompiler, parser, semantics, syntactic analysis, syntax, syntax-directed, translator, 
translator writing system 



15 Workshop on compositional software architectures: workshop report 
May 1998 ACM SIGSOFT Software Engineering Notes, volume 23 issue 3 

Full text available: ^pdf(2.91 MB) Additional Information: full citation , index terms 



16 EMACS the extensible, customizable self-documenting display editor 
Richard M. Stallman 

June 1981 ACM SIGPLAN Notices , Proceedings of the ACM SIGPLAN SIGOA 

symposium on Text manipulation, volume 16 issue 6 
Full text available: ^) pdf(H3MB) Additional Information: full citation , abstract , citings , index terms 

EMACS is a display editor which is implemented in an interpreted high level language. This 
allows users to extend the editor by replacing parts of it, to experiment with alternative 
command languages, and to share extensions which are generally useful. The ease of 
extension has contributed to the growth of a large set of useful features. This paper 
describes the organization of the EMACS system, emphasizing the way in which extensibility 
is achieved and used. This report describe ... 

17 Technical reports 
SIGACT News Staff 

January 1980 ACM SIGACT News, Volume 12 Issue l 
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18 Efficient filtering of XML documents with XPath expressions 
C.-Y. Chan, P. Felber, M. Garofalakis, R. Rastogi 

December 2002 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume ll Issue 4 
Full text available: ^ pdf(383.34 KB) Additional Information: full citation , abstract , citings , index terms 

The publish/subscribe paradigm is a popular model for allowing publishers (i.e., data 
generators) to selectively disseminate data to a large number of widely dispersed 
subscribers (i.e., data consumers) who have registered their interest in specific information 
items. Early publish/subscribe systems have typically relied on simple subscription 
mechanisms, such as keyword or "bag of words" matching, or simple comparison predicates 
on attribute values. The emergence of XML as a standar ... 

Keywords: Data dissemination, Document filtering, Index structure, XML, XPath 



19 Multimedia document architecture (panel session) 

Stephen Bulick, Terry Crowley, Lester Ludwig, Jonathan Rosenberg 
August 1990 ACM SIGGRAPH 90 Panel Proceedings 

Full text available: ^ pdf(4.35 MB) Additional Information: full citation , index terms 





20 Z - the 95% pro g ram editor 
Steven R. Wood 

June 1981 ACM SIGPLAN Notices , Proceedings of the ACM SIGPLAN SIGOA 

symposium on Text manipulation, volume 16 issue 6 
Full text available* fiD pdf(757.05 KB) Additional Information: full citation , abstract , references , citing s, index 
' ^ terms 

Recently much attention has been focused on structure-oriented program editors that have 
specific knowledge about the syntax and semantics of a particular programming language 
[1, 4, 5, 18], These editors provide many desirable features for editing programs. However, 
the user interface is constrained by the syntax and semantics of the target language, and 
editing operations that are simple in a text editor can be quite complicated in a structure- 
oriented editor. In addition, the user has an ... 
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1 Industry session 1: knowledge management. and semantics: Thematic mapping - from 
unstructured documents to taxonomies 

Christina Yip Chung, Raymond Lieu, Jinhui Liu, Alpha Luk, Jianchang Mao, Prabhakar Raghavan 
November 2002 Proceedings of the eleventh international conference on Information 
and knowledge management 

Full text available: ^| pdf(202.36 KB) Additional Information: full citation , abstract , references , index terms 

Verity Inc. has developed a comprehensive suite of tools for accurately and efficiently 
organizing enterprise content which involves four basic steps: (i) creating taxonomies, (ii) 
building classification models, (iii) populating taxonomies with documents, and (iv) 
deploying populated taxonomies in enterprise portals. A taxonomy is a hierarchical 
representation of categories. A taxonomy provides a navigation structure for exploring and 
understanding the underlying corpus without sifting through ... 

Keywords: clustering and labeling, concept discovery, concept tree construction and 
visualization, conceptual search, thematic mapping 



2 Enhanced hypertext categorization using hyperlinks 
Soumen Chakrabarti, Byron Dom, Piotr Indyk 

June 1998 ACM SIGMOD Record , Proceedings of the 1998 ACM SIGMOD international 

conference on Management of data, volume 27 issue 2 
Full text available: 1g| pdf(1.91 MB) Additional Information: full citation , abstract , references , citings , index 

terms 

A major challenge in indexing unstructured hypertext databases is to automatically extract 
meta-data that enables structured search using topic taxonomies, circumvents keyword 
ambiguity, and improves the quality of search and profile-based routing and filtering. 
Therefore, an accurate classifier is an essential component of a hypertext database. 
Hyperlinks pose new problems not addressed in the extensive text classification literature. 
Links clearly contain high-quality semantic clues that ... 

3 Research track: SEWeP: using site semantics and a taxonomy to enhance the Web 
personalization process 

M. Eirinaki, M. Vazirgiannis, I. Varlamis 

August 2003 Proceedings of the ninth ACM SIGKDD international conference on 
Knowledge discovery and data mining 

Full text available- Additional Information: full citation , abstract , references , citings , index 
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pdf(429.65 KB) terms 

Web personalization is the process of customizing a Web site to the needs of each specific 
user or set of users, taking advantage of the knowledge acquired through the analysis of 
the user's navigational behavior. Integrating usage data with content, structure or user 
profile data enhances the results of the personalization process. In this paper, we present 
SEWeP, a system that makes use of both the usage logs and the semantics of a Web site's 
content in order to personalize it. Web content is ... 

Keywords: Web mining, Web personalization, concept hierarchies, semantic annotation of 
Web content 



E-Design Based on the Reuse Paradigm 

L. Ghanmi, A. Ghrab, M. Hamdoun, B. Missaoui, K. Skiba, G. Saucier 

March 2002 Proceedings of the conference on Design, automation and test in Europe 

Full text available: fij pdf(244.78 KB) 

Jsf Additional Information: full citation , abstract , citings 

^ & Publisher Site 

This paper gives an overview on a Virtualelectronic component or IP (Intellectual Property) 
exchange infrastructure whose main components area XML "well structured IP e-catalog 
Builder <L"and a" XML IP profilers While the first module is ae_publishing and an exchange 
management modulethe second has as role to extract from the designdirectories the IP files 
and to trigger their transferto the user site possibly via an IP distribution serverunder the 
catalog control. Direct Design fileextraction fro ... 

Informatics: dialogue: The impatient tutor: an integrated language understanding 
system 

Brian Phillips, James Hendler 

September 1980 Proceedings of the 8th conference on Computational linguistics 

Full text available: ^ pdf(563.00 KB) Additional Information: full citation , abstract , references 

We describe a language understanding system that uses the techniques of segmenting the 
computation into autonomous modules that communicate by message passing. The goal is 
to integrate semantic and syntactic processing to achieve greater flexibility and robustness 
in the design of language understanding systems. 

A message-passing control structure for text understanding 
Brian Phillips, James A. Hendler 

July 1982 Proceedings of the 9th conference on Computational linguistics - Volume 1 

Full text available: ^ pdf(267.18 KB) Additional Information: full citation , abstract , references 

This paper describes an object-oriented, message-passing system for natural language text 
understanding. The application domain is the texts of Texas Instruments' patent 
descriptions. The object-oriented environment permits syntactic analysis modules to 
communicate with domain knowledge modules to resolve ambiguities as they arise. 

On the merits of building categorization systems by supervised clustering 
Charu C. Aggarwal, Stephen C. Gates, Philip S. Yu 

August 1999 Proceedings of the fifth ACM SIGKDD international conference on 
Knowledge discovery and data mining 

Full text available: ^] pdf(618.12 KB) Additional Information: full citation , references , citings, index terms 
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text databases into hierarchical topic taxonomies 

Soumen Chakrabarti, Byron Dom, Rakesh Agrawal, Prabhakar Raghavan 

August 1998 The VLDB Journal — The International Journal on Very Large Data Bases, 

Volume 7 Issue 3 

Full text available: ^pdf(281.37 KB) Additional Information: full citation , abstract , citings , index terms 

We explore how to organize large text databases hierarchically by topic to aid better 
searching, browsing and filtering. Many corpora, such as internet directories, digital 
libraries, and patent databases are manually organized into topic hierarchies, also called 
taxonomies. Similar to indices for relational data, taxonomies make search and access more 
efficient. However, the exponential growth in the volume of on-line textual information 
makes it nearly impossible to maintain such taxono ... 

9 Poster papers: PathScape: indexing audio-visual digital media 
Mike Leggett 

April 2005 Proceedings of the 5th conference on Creativity & cognition 

Full text available: ^ pdf(515.14 KB) Additional Information: full citation , abstract , references , index terms 

The contemporary burgeoning usage of digital movies, photos, audio and text, their 
distribution through networks both electronic and physical, will be considered in the context 
of a convergence of these media with a contemporary engagement with personal and 
community history. An interactive experimental prototype, PathScape, will be described and 
evaluated and further practice-based research approaches to author-defined storage and 
retrieval systems will be outlined. 

Keywords: digital media, index, interactive, taxonomy 




10 Total order broadcast and multicast algorithms: Taxonomy and survey 
Xavier Defago, Andre Schiper, Peter Urban 

December 2004 ACM Computing Surveys (CSUR), volume 36 issue 4 

Full text available: ^[ pdf(544.45 KB) Additional Information: full citation , abstract , references , index terms 

Total order broadcast and multicast (also called atomic broadcast/multicast) present an 
important problem in distributed systems, especially with respect to fault-tolerance. In 
short, the primitive ensures that messages sent to a set of processes are, in turn, delivered 
by all those processes in the same total order. 

Keywords: Distributed systems, agreement problems, atomic broadcast, atomic multicast, 
classification, distributed algorithms, fault-tolerance, global ordering, group communication, 
message passing, survey, taxonomy, total ordering 




11 A data dictionary as a Lexicon: an application of linguistics in information systems 
J. F. M. Burg, R. P. van de Riet, S. C. Chang 

December 1993 Proceedings of the second international conference on Information and 
knowledge management 

Full text available: ^pdf(826.11 KB) Additional Information: full citation , references , index terms 



12 Conceptual analysis of lexical taxonomies: the case of WordNet top-level Q 
Aldo Gangemi, Nicola Guarino, Alessandro Oltramari 

October 2001 Proceedings of the international conference on Formal Ontology in 
Information Systems - Volume 2001 

Full text available: ^C |pdf(1.27 MB) Additional Information: full citation , abstract , references , citin gs, index 
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terms , review 

In this paper we propose an analysis and an upgrade of WordNet's top-level synset 
taxonomy. We briefly review WordNet and Identify its main semantic limitations. Some 
principles from a forthcoming OntoClean methodology are applied to the ontological 
analysis of WordNet. A revised top-level taxonomy is proposed, which is meant to be more 
conceptually rigorous, cognitively transparent, and efficiently exploitable in several 
applications. 

Keywords: WordNet, ontology, taxonomies, top-level 



1 3 Contribution of a category hierarchy to the robustness of syntactic parsing. 
Damien Genthial, Jacques Courtin, Irene Kowarski 

August 1990 Proceedings of the 13th conference on Computational linguistics - Volume 
2 

Full text available: ^ pdf(478.07 KB) Additional Information: full citation , abstract , references , citings 

We describe how the use of a hierarchy of lexical categories instead of a simple set of 
categories leads to the definition of a flexible and precise language for the description of 
dependency structures. After specifying the formalism we use to decorate these structures, 
we present an application aiming to detect and correct errors in a written text. We outline 
how the use of the hierarchy improves the manipulation of unknown words. 

14 Posters: An agent system reasoning about the web and the user 
Giovambattista Ianni, Francesco Ricca, Francesco Calimeri, Vincenzino Lio, Stefania Galizia 
May 2004 Proceedings of the 13th international World Wide Web conference on 

Alternate track papers & posters 

Full text available: ^ pdf(43.02 KB) Additional Information: full citation , abstract , references , index terms 

The paper describes some innovations related to the ongoing work on the GSA prototype, 
an integrated information retrieval agent. In order to improve the original system 
effectiveness, we propose the GSA2 system, introducing a new internal architecture based 
on a message-passing framework and on an ontology description formalism (WOLF, Web 
ontology Framework). GSA2 is conceived in order to describe and easily perform reasoning 
on "facts about the web and the user". The most ... 

Keywords: agents, answer set programming, information retrieval, logic programming 



1 5 Query result processing: A hierarchical monothetic document clustering algorithm for Q 
summarization and browsing search results 

Krishna Kummamuru, Rohit Lotlikar, Shourya Roy, Karan Singal, Raghu Krishnapuram 
May 2004 Proceedings of the 13th international conference on World Wide Web 

Full text available: ^ | pdf(446.94 KB) Additional Information: full citation , abstract , references , index terms 

Organizing Web search results into a hierarchy of topics and sub-topics facilitates browsing 
the collection and locating results of interest. In this paper, we propose a new hierarchical 
monothetic clustering algorithm to build a topic hierarchy for a collection of search results 
retrieved in response to a query. At every level of the hierarchy, the new algorithm 
progressively identifies topics in a way that maximizes the coverage while maintaining 
distinctiveness of the topics. We refer the pro ... 

Keywords: automatic taxonomy generation, clustering, data mining, search, 
summarization 
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16 Data mining (DM): Framework for mining web content outliers 
Malik Agyemang, Ken Barker, Reda Alhajj 

March 2004 Proceedings of the 2004 ACM symposium on Applied computing 

Full text available: ^) pdfd 38.47 KB) Additional Information: full citation , abstract , references , index terms 

Outliers are data objects with different characteristics compared to other data objects. 
Exploring the diverse and dynamic web data for outliers is more interesting than finding 
outliers in numeric data sets. Interestingly, the existing web mining algorithms have 
concentrated on finding patterns that are frequent while discarding the less frequent ones 
that are likely to contain the outlying data. This paper refers to outliers present on the web 
as web outliers to distinguish them from tr 

Keywords: content-specific algorithms, data mining, resource extraction, web outliers 



17 S pecial issue: Al in engineering Q 
D. Sriram, R. Joobbani 

January 1985 ACM SIGART Bulletin, issue 91 

Full text available: ^pdf(8.79MB) Additional Information: full citation , abstract 

The papers in this special issue were compiled from responses to the announcement in the 
July 1984 issue of the SIGART newsletter and notices posted over the ARPAnet. The interest 
being shown in this area is reflected in the sixty papers received from over six countries. 
About half the papers were received over the computer network. 

18 Full papers: Losers and finders: indexing audio-visual digital media 
Mike Leggett 

April 2005 Proceedings of the 5th conference on Creativity & cognition 

Full text available: ^ pdf(323.17 KB) Additional Information: full citation , abstract , references , index terms 

The contemporary burgeoning usage of digital movies, photos, audio and text, their 
distribution through networks both electronic and physical will be considered in the context 
of a convergence of these media with a popular interest in personal and community history 
and identity.The paper introduces interdisciplinary research into human memory as a 
context for understanding its relation to machine memory and methods of storing and 
retrieval. It proposes an approach to indexing audio-visual media ... 

Keywords: digital media, index, interactive, taxonomy 



1 9 Industry track: Intelligent agent for automated manufacturing rule generation Q 
Alan Clark, Dimitar Filev 

November 2004 Proceedings of the Thirteenth ACM conference on Information and 
knowledge management 

Full text available: ^] pdf(247.85 KB) Additional Information: full citation , references , index terms 



Keywords: KBE, clustering algorithms, intelligent agent, knowledge extraction, knowledge 
management, latent semantic indexing 



20 Text classification: Parameterized generation of labeled datasets for text categorization Q 
based on a hierarchical directory 
Dmitry Davidov, Evgeniy Gabrilovich, Shaul Markovitch 

July 2004 Proceedings of the 27th annual international conference on Research and 
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development in information retrieval 

Full text available: fflpdff 164.60 KB) Additional Information: full citation , abstract, references , citings, index 
^ terms 

Although text categorization is a burgeoning area of IR research, readily available test 
collections in this field are surprisingly scarce. We describe a methodology and system 
(named ACCIO) for automatically acquiring labeled datasets for text categorization from the 
World Wide Web, by capitalizing on the body of knowledge encoded in the structure of 
existing hierarchical directories such as the Open Directory. We define parameters of 
categories that make it possible to acquire nu ... 
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41 Personalized spiders for web search and analy sis 
Michael Chau, Daniel Zeng, Hinchun Chen 

January 2001 Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries 

Additional Information: full citation , abstract , references , citings , index 
terms 



Full text available: 



Searching for useful information on the World Wide Web has become incr easingly difficult. 
While Internet search engines have been helping people to search on the web, low recall 
rate and outdated indexes have become more and more problematic as the web grows. In 
addition, search tools usually present to the user only a list of search results, failing to 
provide further personalized analysis which could help users identify useful information and 
comprehend these results. To alleviate these ... 

Keywords: information retrieval, internet searching and browsing, internet spider, noun- 
phrasing, personalization, self-organizing map 



42 Natural language information retrieval in digital libraries 
Tomek Strzalkowski, Jose Perez-Carballo, Mihnea Marinescu 

April 1996 Proceedings of the first ACM international conference on Digital libraries 

Full text available: ^| pdf(1.03 MB) Additional Information: full citation , references , index terms 



43 Document Examiner: delivery interface for hypertext documents 
Janet H. Walker 

November 1987 Proceeding of the ACM conference on Hypertext 

Full text available- fid pdf(1 28 MB) Additional Information: full citation , abstract , references , citings , index 
' ^ terms 

This paper describes the user interface strategy of Document Examiner, a delivery interface 
for commercial hypertext documents. Unlike many hypertext interfaces, Document 
Examiner does not adopt the directed graph as its fundamental user-visible navigation 
model. Instead it offers context evaluation and content-based searching capabilities that 
are based on consideration of the strategies that people use in interacting with paper 
documents. 

44 Summarization-based query expansion in information retrieval 
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Tomek Strzalkowski, Jin Wang, Bowden Wise 
August 1998 

Full text available: fi Q pdf(726.07 KB) 
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We discuss a semi-interactive approach to information retrieval which consists of two tasks 
performed in a sequence. First, the system assists the searcher in building a comprehensive 
statement of information need, using automatically generated topical summaries of sample 
documents. Second, the detailed statement of information need is automatically processed 
by a series of natural language processing routines in order to derive an optimal search 
query for a statistical information retrieval sys ... 

45 A unified environment for fusion of information retrieval approaches 
M. Catherine McCabe, Abdur Chowdhury, David A. Grossman, Ophir Frieder 
November 1999 Proceedings of the eighth international conference on Information and 
knowledge management 

Full text available* f £l pdf(566 22 KB) Additional Information: full citation , abstract , references , citings , index 

1 terms 

Prior work has shown that combining results of various retrieval approaches and query 
representations can improve search effectiveness. Today, many meta-search engines exist 
which combine the results of various search engines in the hopes of improving overall 
effectiveness. However, the combination of results from different search engines masks 
variations in parsers, and other indexing techniques (stemming, stop words, etc.) This 
makes it difficult to assess the utility of the fusion techni ... 

Keywords: fusion, information retrieval, metasearch, retrieval, text 



4 6 An intelligent multilingual information browsing and retrieval system using information Q 
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Chinatsu Aone, Nicholas Charocopos, James Gorlinsky 0 

March 1997 Proceedings of the fifth conference on Applied natural language 
processing 



Full text available: 




Additional Information: full citation , abstract , references , citings 



Publisher Site 

In this paper, we describe our multilingual (or cross-linguistic) information browsing and 
retrieval system, which is aimed at monolingual users who are interested in information 
from multiple language sources. The system takes advantage of information extraction (IE) 
technology in novel ways to improve the accuracy of cross-linguistic retrieval and to provide 
innovative methods for browsing and exploring multilingual document collections. The 
system indexes texts in different languages ( ... 



47 Placing search in context: the concept revisited 

Lev Finkelstein, Evgeniy Gabrilovich, Yossi Matias, Ehud Rivlin, Zach Solan, Gadi Wolfman, 
Eytan Ruppin 

April 2001 Proceedings of the 10th international conference on World Wide Web 

Full text available: pdf(235.96 KB) Additional Information: full citation , references , citings , index terms 



Keywords: context, invisible web, search, semantic processing, statistical natural language 
processing 
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48 Integrating automatic genre analysis into digital libraries 
Andreas Rauber, Alexander Muller-Kogler 

January 2001 Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries 

i- ii* ^ i u. a , /e7on0 uD\ Additional Information: full citation , abstract , references , citings , index 

Full text available: TCI pdf(672.98 KB) 

" terms 

With the number and types of documents in digital library systems incr easing, tools for 
automatically organizing and presenting the content have to be found. While many 
approaches focus on topic-based organization and structuring, hardly any system 
incorporates automatic structural analysis and representation. Yet, genre information 
(unconsciously) forms one of the most distinguishing features in conventional libraries and 
in information searches. In this paper we present an approach to au ... 

Keywords: SOMLib, document clustering, genre analysis, metaphor graphics, self- 
organizing map (SOM), visualization 



49 Using properties for uniform interaction in the Presto document system 
Paul Dourish, W. Keith Edwards, Anthony LaMarca, Michael Salisbury 
November 1999 Proceedings of the 12th annual ACM symposium on User interface 
software and technology 

Full text available- S pdf(477 56 KB) Additional Information: full citation , abstract , references , citings, index 

1 terms 

Most document or information management systems rely on hierarchies to organise 
documents (e.g. files, email messages or web bookmarks). However, the rigid structures of 
hierarchical schemes do not mesh well with the more fluid nature of everyday document 
practices. This paper describes Presto, a prototype system that allows users to organise 
their documents entirely in terms of the properties those documents hold for users. 
Properties provide a uniform mechanism for managing, coding, searc ... 

Keywords: document interfaces, document management, document properties, interaction 
models 



50 Scalable collection summarization and selection 
R. Dolin, D. Agrawal, E. El Abbadi 

August 1999 Proceedings of the fourth ACM conference on Digital libraries 

Full text available: ^ pdf(263.57 KB) Additional Information: full citation , references , citing s, index terms 
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51 Emergent web patterns: Automatically sharing web experiences throu g h a 
h yperdocument recommender system 

Alessandra Alaniz Macedo, Khai N. Truong, Jose Antonio Camacho-Guerrero, Maria da Gra£a 
Pimentel 

August 2003 Proceedings of the fourteenth ACM conference on Hypertext and 
hypermedia 

Full text available: fflpdf(620.88 KB) Additlonal Information: full citation , abstract, references , cjtings, index 

terms 

As an approach that applies not only to support user navigation on the Web, recommender 
systems have been built to assist and augment the natural social process of asking for 
recommendations from other people. In a typical recommender system, people provide 
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suggestions as inputs, which the system aggregates and directs to appropriate recipients. 
In some cases, the primary computation is in the aggregation; in others, the value of the 
system lies in its ability to make good matches between the re ... 

Keywords: information retrieval, open hypermedia, recommender systems, semantic 
structures, web 



52 Data structures for efficient broker implementation 

Anthony Tomasic, Luis Gravano, Calvin Lue, Peter Schwarz, Laura Haas 

July 1997 ACM Transactions on Information Systems (TOIS), volume is issue 3 

Full text available* f 3pdf(316 45 KB) Additional Information: full citation , abstract , references , citings , index 
• : terms , review 

With the profusion of text databases on the Internet, it is becoming increasingly hard to 
find the most useful databases for a given query. To attack this problem, several existing 
and proposed systems employ brokers to direct user queries, using a local database of 
summary information about the available databases. This summary information must 
effectively distinguish relevant databases and must be compact while allowing efficient 
access. We offer evidence that one broker, GIOSS 

Keywords: GIOSS, broker architecture, broker performance, distributed information, grid 
files, partitioned hashing 



53 Analysis of lexical si g natures for improving information persistence on the World Wide Q 
Web 

Seung-Taek Park, David M. Pennock, C. Lee Giles, Robert Krovetz 

October 2004 ACM Transactions on Information Systems (TOIS), Volume 22 issue 4 

. Full text available: ^ pdf(8Q8.10 KB) Additional Information: full citation , abstract , references , index terms 

A <i>lexical signature</i> (LS) consisting of several key words from a Web document is 
often sufficient information for finding the document later, even if its URL has changed. We 
conduct a large-scale empirical study of nine methods for generating lexical signatures, 
including Phelps and Wilensky's original proposal (PW), seven of our own static variations, 
and one new dynamic method. We examine their performance on the Web over a 10-month 
period, and on a TREC data set, evaluating t ... 

Keywords: Broken URLs, TREC, World Wide Web, dead links, digital libraries, indexing, 
information retrieval, inverse document frequency, lexical signatures, robust hyperlinks, 
search engines, term frequency 



54 Mobile data management: Mimic: raw activity shippin g for file synchronization in mobile Q 
file systems 

Tae-Young Chang, Aravind Velayutham, Raghupathy Sivakumar 
June 2004 Proceedings of the 2nd international conference on Mobile systems, 
applications, and services 

Full text available: ^ pdf(334.54 KB) Additional Information: full citation , abstract , references , index terms 

In this paper, we consider the problem of file synchronization when a mobile host shares 
files with a backbone file server in a network file system. Several diff schemes have been 
proposed to improve upon the transfer overheads of conventional file synchronization 
approaches which use full file transfer. These schemes compute the binary diff of the new 
file with respect to the old copy at the server and transfer the computed diff to the server 
for file-synchronization. Howev ... 

Keywords: file synchronization, mobile file system, raw activity shipping 
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55 Text categorization for multi-page documents: a hybrid naive Bayes HMM approach 
Paolo Frasconi, Giovanni Soda, Alessandro Vullo 

January 2001 Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries 

Full text available: ^ pdf(280.05 KB) Additional Information: full citation , abstract , references , index terms 

Text categorization is typically formulated as a concept learning prob lem where each 
instance is a single isolated document. In this paper we are interested in a more general 
formulation where documents are organized as page sequences, as naturally occurring in 
digital libraries of scanned books and magazines. We describe a method for classifying 
pages of sequential OCR text documents into one of several assigned categories and 
suggest that taking into account contextual information provid ... 

Keywords: hidden Markov models, multi-page documents, naive Bayes classifier, text 
categorization 



56 Constant interaction-time scatter/gather browsing of very large document collections 
Douglass R. Cutting, David R. Karger, Jan O. Pedersen 

July 1993 Proceedings of the 16th annual international ACM SIGIR conference on 
Research and development in information retrieval 

Full text available: ■ m P df(798.85 KB) Additional Information: full citation , abstract, references , citings, index 
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The Scatter/Gather document browsing method uses fast document clustering to produce 
table-of-contents-like outlines of large document collections. Previous work [1] developed 
linear-time document clustering algorithms to establish the feasibility of this method over 
moderately large collections. However, even linear-time algorithms are too slow to support 
interactive browsing of very large collections such as Tipster, the DARPA standard text 
retrieval evaluation collection. We present a sc ... 

57 S pecial issue on knowledge representation 
Ronald J. Brachman, Brian C. Smith 
February 1980 ACM SIGART Bulletin, issue 70 

Full text available: ^| pdf(13.13 MB) Additional Information: full citation , abstract 

In the fall of 1978 we decided to produce a special issue of the SIGART Newsletter devoted 
to a survey of current knowledge representation research. We felt that there were twe 
useful functions such an issue could serve. First, we hoped to elicit a clear picture of how 
people working in this subdiscipline understand knowledge representation research, to 
illuminate the issues on which current research is focused, and to catalogue what 
approaches and techniques are currently being developed. Secon ... 

58 Ex periences with selecting search engines using metasearch 
Daniel Dreilinger, Adele E. Howe 

July 1997 ACM Transactions on Information Systems (TOIS), volume 15 issue 3 

Full text available: fj ft pdf(428.65 KB) Additiona l Information: full citation , abstract , references , citings , index 

terms , review 

Search engines are among the most useful and high-profile resources on the Internet. The 
problem of finding information on the Internet has been replaced with the problem of 
knowing where search engines are, what they are designed to retrieve, and how to use 
them. This article describes and evaluates SavvySearch, a metasearch engine designed to 
intelligently select and interface with multiple remote search engines. The primary 
metasearch issue examined is the importance of carefully selecti ... 
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59 Text categorization and retrieval: Robust text processing in automated information Q 
retrieval 

Tomek Strzalkowski 

October 1994 Proceedings of the fourth conference on Applied natural language 
processing 

Full text available: fg |pdf(593.70 KB) 

Additional Information: full citation , abstract , references , citings 

' Publisher Site 

We report on the results of a series of experiments with a prototype text retrieval system 
which uses relatively advanced natural language processing techniques in order to enhance 
the effectiveness of statistical document retrieval. In this paper we show that large-scale 
natural language processing (hundreds of millions of words and more) is not only required 
for a better retrieval, but it is also doable, given appropriate resources. In particular, we 
demonstrate that the use of syntactic compo ... 

60 Ex pansion of multi-word terms for indexing and retrieval using morphology and syntax Q 
Christian Jacquemin, Judith L. Klavans, Evelyne Tzoukermann 

July 1997 

Full text available: f gpdf(751.77 KB) 

Additional Information: full citation , abstract , references , citings 

^ Publisher Site 

A system for the automatic production of controlled index terms is presented using 
linguistically-motivated techniques. This includes a finite-state part of speech tagger, a 
derivational morphological processor for analysis and generation, and a unification-based 
shallow-level parser using transformational rules over syntactic patterns. The contribution 
of this research is the successful combination of parsing over a seed term list coupled with 
derivational morphology to achieve greater coverage ... 
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1 Hierarchical classification of Web content 
Susan Dumais, Hao Chen 

July 2000 Proceedings of the 23rd annual international ACM SIGIR conference on 
Research and development in information retrieval 

Full text available- f?l pdfd 16 MB) Additional Information: full citation , abstract , references , citings, index 
' ^ terms 

This paper explores the use of hierarchical structure for classifying a large, heterogeneous 
collection of web content. The hierarchical structure is initially used to train different 
second-level classifiers. In the hierarchical case, a model is learned to distinguish a second- 
level category from other categories within the same top level. In the flat non-hierarchical 
case, a model distinguishes a second-level category from all other second-level categories. 
Scoring rules can further take ad ... 

Keywords: Web hierarchies, classification, hierarchical models, machine learning, support 
vector machines, text catergorization, text classification 



2 The explor atio n of le g al text corpora with hi erarchical neural networks: a g uided tour in Q 
public international law 
Dieter Merkl, Erich Schweighofer 

June 1997 Proceedings of the 6th international conference on Artificial intelligence 
and law 

Full text available: ^ [pdfd.12 MB) Additional Information: full citation , references , citings , index terms 



3 Session 3: interfacing stored media I: In-home access to multimedia content 
Dario Teixeira, Yassine Faihe 

December 2002 Proceedings of the tenth ACM international conference on Multimedia 

Full text available: ^ pdf(1 94.34 KB) Additional Information: full citation , abstract , references 

This paper discusses the problem of content overload in the home environment. We 
describe the various domains where the problem manifests itself, and we analyse which 
user activities could benefit from software assistance. In particular, we focus on the 
problem of searching for information, and we introduce the concept of Conversational 
Search as the means to tackle it. We also introduce a framework to test several heuristics 
related to Conversational Search, and show how using a model of ... 
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Keywords: browsing assistants, content overload, conversational interfaces, 
conversational search, digital photography, home environments, multimedia, user modelling 



4 Query result processing: A hierarchical monothetic document clustering algorithm for J 
summarization and browsing search results 

Krishna Kummamuru, Rohit Lotlikar, Shourya Roy, Karan Singal, Raghu Krishnapuram 
May 2004 Proceedings of the 13th international conference on World Wide Web 

Full text available: ^| pdf(446.94 KB ) Additional Information: full citation , abstract , references , index terms 

Organizing Web search results into a hierarchy of topics and sub-topics facilitates browsing 
the collection and locating results of interest. In this paper, we propose a new hierarchical 
monothetic clustering algorithm to build a topic hierarchy for a collection of search results 
retrieved in response to a query. At every level of the hierarchy, the new algorithm 
progressively identifies topics in a way that maximizes the coverage while maintaining 
distinctiveness of the topics. We refer the pro ... 

Keywords: automatic taxonomy generation, clustering, data mining, search, 
summarization 



5 An explanation facility for a grammar writing system 
Loong Cheong Tong 

August 1990 Proceedings of the 13th conference on Computational linguistics 
2 

Full text available: ^ pdf(587.09 KB) Additional Information: full citation , abstract , references 

Explanation has become a standard feature in many expert systems today. Adapting from 
this work, a study was made to determine the types of explanation required in a grammar 
writing system and to investigate design and implementation issues. The first version of this 
explanation facility is based on a derivational history of the inferencing process, and 
although no supplementary knowledge is used, this explanation facility is able to furnish 
answers to the traditional why, how and what type o ... 

6 What is text, really? 

Steven J. DeRose, David G. Durand, Elli Mylonas, Allen H. Renear 

August 1997 ACM SIGDOC Asterisk Journal of Computer Documentation, Volume 21 issue 3 
Full text available: g pdf(1.20MB) Additional Information: full citation , abstract , citings, index terms 

THE WAY IN WHICH TEXT IS represented on a computer affects the kinds of uses to which 
it can be put by its creator and by subsequent users. The electronic document model 
currently in use is impoverished and restrictive. The authors argue that text is best 
represented as an ordered hierarchy of content object (OHCO), because that is what text 
really is. This model conforms with emerging standards such as SGML and contains within it 
advantages for the writer, publisher, and researcher. The authors ... 
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7 Bioinformatics (BIO): An architecture for biological information extraction and 
representation 

Aditya Vailaya, Peter Bluvas, Robert Kincaid, Allan Kuchinsky, Michael Creech, Annette Adler 
March 2004 Proceedings of the 2004 ACM symposium on Applied computing 

Full text available: ^) pdf(355.71 KB) Additional Information: full citation , abstract , references 

Technological advances in biomedical research are generating a plethora of heterogeneous 
data at a high rate. There is a critical need for extraction, integration and management 
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tools for information discovery and synthesis from these heterogeneous data. In this paper, 
we present a general architecture, called ALFA, for information extraction and 
representation from diverse biological data. The ALFA architecture consists of: (i) a 
networked, hierarchical object model for representing information ... 

Keywords: bioinformatics, filtering, heterogeneous data, information representation, 
information retrieval, interactive text mining, software architecture, user-guided 
information extraction 



8 A reusable, academic-strength, metrics-based software engineerin g process for 
ca pstone courses and projects 
Richard Conn 

March 2004 ACM SZGCSE Bulletin , Proceedings of the 35th SIGCSE technical 

symposium on Computer science education, volume 36 issue i 
Full text available: ^ pdf(223.31 KB) Additional Information: full citation , abstract , references , index terms 

This paper describes a mature Software Engineering Process that may be applied to 
capstone courses, student projects, and research projects in a university environment. This 
process, based in part on the Team Software Process of the Software Engineering Institute, 
features mature software engineering best practices, including extensive use of metrics to 
gain insight into process effectiveness and product quality. It is designed to be executed in 
a single 16-week semester, and it can easily be mo ... 

Keywords: capability maturity model, communicating sequential process, data primitives, 
defect, personal software process, process improvement, software engineering institute, 
structured query language, team software process, unified modeling language, visual basic 



9 Getting and giving information; What is this text about? 
Nicolas Hernandez, Brigitte Grau 

October 2003 Proceedings of the 21st annual international conference on 
Documentation 

Full text available: ^ pdf(229.96 KB) Additional Information: full citation , abstract , references , index terms 

Most work in text retrieval aims at presenting the information held by several texts in order 
to give entry clues towards these texts and to allow a navigation between them. Besides, a 
lesser interest is dedicated to the definition of principles for accessing content of single 
documents. As most information retrieval systems return documents from an initial request 
made of words, a usual solution consists of presenting document titles and highlighting 
words of the request inside a passage or in ... 

Keywords: dynamic summarization, meta-descriptors and topical descriptors identification, 
text structure, text visualization 



10 Visibility culling using hierarchical occlusion maps 

Hansong Zhang, Dinesh Manocha, Tom Hudson, Kenneth E. Hoff 

August 1997 Proceedings of the 24th annual conference on Computer graphics and 
interactive techniques 

Full text available: fg pdf(597.69 KB) Additional Information: full citation , references , citings, index terms 



Keywords: hierarchical data structures, image pyramid, interactive display, occlusion 
culling, visibility culling 
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11 Industrial and practical experience track paper session 1: A personalized search 
engine based on web-snippet hierarchical clustering 
Paolo Ferragina, Antonio Gulli 

May 2005 Special interest tracks and posters of the 14th international conference on 
World Wide Web 

Full text available: ^ pdf(514.22 KB) Additional Information: full citation , abstract , references , index terms 

. In this paper we propose a hierarchical clustering engine, called snaket, that is able to 
organize on-the-fly the search results drawn from 16 commodity search engines into a 
hierarchy of labeled folders. The hierarchy offers a complementary view to the flat-ranked 
list of results returned by current search engines. Users can navigate through the hierarchy 
driven by their search needs. This is especially useful for informative, polysemous and poor 
queries. SnakeT is the first complete an ... 

Keywords: information extraction, new search applications and interfaces, personalized 
web ranking, search engines, web snippets clustering 



12 Book reviews: Abstracts of current literature 
Computational Linguistics Staff 

September 1989 Computational Linguistics, volume 15 issue 3 

Full text available: fa pdf(339.85 KB) 
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13 The structure of user-adviser dialogues: is there method in their madness? 
Raymonde Guindon, Paul Sladky, Hans Brunner, Joyce Conner 

July 1986 Proceedings of the 24th annual meeting on Association for Computational 
Linguistics 

Full text available: pdf(653.30 KB) 

|Sf Additional Information: full citation , abstract , references , citings 

tfp 1 Publisher Site 

Novice users engaged in task-oriented dialogues with an adviser to learn how to use an 
unfamiliar statistical package. The users' task was analyzed and a task structure was 
derived. The task structure was used to segment the dialogue into subdialogues associated 
with the subtasks of the overall task. The representation of the dialogue structure into a 
hierarchy of subdialogues, partly corresponding to the task structure, was validated by 
three converging analyses. First, the distribution of non- ... 

14 Special issue on knowledge representation 
Ronald J. Brachman, Brian C. Smith 
February 1980 ACM SIGART Bulletin, issue 70 

Full text available: fi fl pdf(13.13 MB) Additional Information: full citation , abstract 



In the fall of 1978 we decided to produce a special issue of the SIGART Newsletter devoted 
to a survey of current knowledge representation research. We felt that there were twe 
useful functions such an issue could serve. First, we hoped to elicit a clear picture of how 
people working in this subdiscipline understand knowledge representation research, to 
illuminate the issues on which current research is focused, and to catalogue what 
approaches and techniques are currently being developed. Secon ... 



15 SIGART special issue on machine learning 
April 1981 ACM SIGART Bulletin, issue 76 
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Current research on Machine Learning encompasses a diverse set of approaches, and of 
opinions regarding where the important issues lie. The significant increase of interest and 
research activity in Machine Learning over the past few years has led us to organize this 
special issue of SIGART, whose purpose is to provide a snapshot of current research in this 
field. This issue contains a set of summaries of ongoing research, solicited from the 
community at large, and received from thirty-five resea ... 

1 6 Extractin g significant time varying features from text Q 
Russell Swan, James Allan 

November 1999 Proceedings of the eighth international conference on Information and 
knowledge management 



We propose a simple statistical model for the frequency of occurrence of features in a 
stream of text. Adoption of this model allows us to use classical significance tests to filter 
the stream for interesting events. We tested the model by building a system and running it 
on a news corpus. By a subjective evaluation, the system worked remarkably well: almost 
all of the groups of identified tokens corresponded to news stories and were appropriately 
placed in time. A preliminary objective eva ... 

17 Metamodeling in EIA/CDIF— meta-metamodel and metamodels Q 
Rony G. Flatscher 

October 2002 ACM Transactions on Modeling and Computer Simulation (TOMACS), 

Volume 12 Issue 4 



This article introduces the EIA/CDIF set of standards for the modeling of information 
systems and its exchange among computer-aided software tools of different vendors. It 
lays out the meta-metamodel and the standardized metamodels which are fully depicted in 
a hierarchical layout and annotated with the unique identifiers of all the standardized 
modeling concepts. The article also stresses the fact that EIA/CDIF has been used as the 
baseline in the creation of an international standard, the ISO/ ... 

Keywords: CASE Data Interchange Format, CDIF, EIA, clear text encoding, languages, 
meta-metamodels, metamodels, system design 
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July 1997 Proceedings of the second ACM international conference on Digital libraries 
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digital libraries, heterogeneity, interoperability, metadata architecture, metadata repository, 
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The papers in this special issue were compiled from responses to the announcement in the 
July 1984 issue of the SIGART newsletter and notices posted over the ARPAnet. The interest 
being shown in this area is reflected in the sixty papers received from oyer six countries. 
About half the papers were received over the computer network. 

20 Storing HyTime documents in an object-oriented databases j 
Klemens Bohm, Karl Aberer 

November 1994 Proceedings of the third international conference on Information and 
knowledge management 

Full text available* 1"! pdfd 03 MB) Additional Information: full citation , abstract , references , citing s, index 
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An open hypermedia-document storage system has to meet requirements that are not 
satisfied by existing systems: it has to support non-generic hypermedia document types, 
i.e. document types enriched with application-specific semantics. It has to provide 
hypermedia-document access methods. Finally, it has to allow the exchange of hypermedia 
documents with other systems. On a technical level, an object-oriented database- 
management system, on a logical level, a well established ISO standard, na ... 
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